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ABSTRACT 

We have developed a parameter-independent method to detect local maxima of the 
i~ h two-point correlation function. By applying it to two samples of rich Abell clusters of 

galaxies with redshift limits z < 0.08 and z < 0.12 we detect three maxima centered at 
150 h~ l Mpc, 300 h~ l Mpc and 430 h~ l Mpc with confidence levels 80% and higher. This 
sequence of fluctuations has an average interval of 140 h~ l Mpc, that can be explained 
by a power spectrum with a distinct peak at k = 0.048 ± 0.005 h Mpc -1 . 
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1. Introduction 

There has been a growing evidence on the presence 
of large-scale inhomogeneities in the distribution of 
matter at scales around 100 h~ l Mpc. The first hints 
came from the distribution of very rich clusters of 
galaxies, observed by Kopylov et al. (1984, 1988) and 
later by Fetisova et al. (1993). They found a sec- 
ondary maximum in the two-point correlation func- 
tion that has been later confirmed by Mo et al. (1992a, 
b) and Einasto & Gramann (1993). However, the sta- 
tistical significance of the secondary maximum in the 
cluster correlation function was never assessed. 

Deep pencil-beam surveys have revealed density 
peaks in the distribution of galaxies with a surpris- 
ingly regular spacing of about 128 h~ l Mpc (Broad- 
hurst et al. 1990). These peaks were attributed to the 
locations of superclusters as defined by rich clusters 
of galaxies (Bahcall 1991). The power spectrum of 
the density distribution in this one-dimensional sur- 
vey has a sharp maximum at about 130 h~ l Mpc. 
Whether this feature is present in three-dimensional 
space is less clear, since redshift surveys of galaxies 
up to z « 0.2, covering a large fraction of the sky, are 
only in the planning stage (Gunn & Weinberg 1995). 
However, an analysis of the three-dimensional distri- 
bution of superclusters suggests indeed the presence 
of the characteristic scale at about 130 h~ l Mpc in 
the supercluster-void network (Einasto et al. 1994). 

Such a scale, if present, should also be reflected 
in the power spectrum. The power spectrum of mat- 
ter has a positive index n Ri 1 on very large scales 
as suggested by theoretical arguments and confirmed 
by COBE (Smoot et al. 1992), and a negative index 
— 2 < n < — Ion small scales, as emphasized by re- 
cent studies of the distribution of galaxies of vari- 
ous types. Thus there must be a transition on in- 
termediate scales which can be identified as a peak. 
The Cf A Redshift Survey of galaxies shows a turnover 
at A « 150 h~ l Mpc, albeit with a small statistical 
significance (Vogeley et al. 1992). A comparison of 
IRAS galaxies with Abell clusters and radio galaxies 
suggests a maximum at 160 h~ l Mpc (Mo, Peacock 
& Xia 1992). The maximum is also detected in the 
distribution of rich Abell clusters (R > 1) at wave- 
lengths A > 100 h~ l Mpc (Peacock & West 1992, 
hereafter PW), as well as in the whole Abell/ ACO 
cluster sample (R > 0) (Einasto et al. 1993). In 
contrast, no turnover has been found in the recent 
CfA Redshift Survey, though the spectrum flattens 



at A > 120 h~ l Mpc (Park et al. 1994). As the shape 
of the spectrum has an influence on the matter-void 
network (Frisch et al. 1995), it should be possible to 
use the distribution of objects on large scales to de- 
termine the behavior of the power spectrum for these 
wavelengths. 

Although most of these results support the pres- 
ence of large-scale inhomogeneities, the statistical sig- 
nificance of this feature is unclear. The order of the 
characteristic scale found so far is in agreement with 
pencil-beam surveys and can be expected at about 
130 h~ l Mpc (Dekel et al. 1992). 

The purpose of this paper is to investigate the pres- 
ence of inhomogeneities in three-dimensional density 
distributions and to estimate the statistical signifi- 
cance of the result. We shall base our analysis on the 
Abell/ ACO catalog of rich clusters of galaxies which 
is the deepest available complete survey covering a 
large fraction of the sky. The paper is organized as 
follows: in section 2 we describe the observational 
data used, in section 3 we develop a novel method for 
the detection of maxima of the two-point correlation 
function, in section 4 we present our analysis and the 
main results are summarized in the conclusions. 

2. Data 

We shall base our analysis on the Abell/ ACO cat- 
alog of rich clusters of galaxies (Abell 1958; Abell, 
Corwin & Olowin 1989). The redshift information for 
these clusters has been continuously updated by two 
of us (ET & HA; cf. Andernach et al. 1994). 

It has been pointed out that this catalog is biased 
by projection effects (Sutherland & Efstathiou 1991). 
However, this effect is present mostly in the sample 
that includes all clusters (R > 0), and absent for the 
R > 1 subsample, for which there is no evidence that 
the close pairs of clusters reflect anything other than 
true spatial correlations (PW). 

We shall first consider the same sample used by 
Peacock & West (1992), but exclude low galactic lat- 
itudes to ensure better completeness for higher red- 
shifts. The limiting criteria are: 

R>1, 0.01 < z < 0.08, 

&>40°, &<-30°. 

This selection includes 175 clusters with measured 
redshifts and 3 clusters with estimated redshifts. For 
the estimation of photometric redshifts we used the 
formulae proposed by Peacock & West (1992). 
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We shall also consider a sample with the same se- 
lection parameters but using a higher redshift limit. 
Since the catalog becomes largely incomplete in mea- 
sured redshifts at about z « 0.13, we limit the second 
sample to 0.01 < z < 0.12. This includes 382 mea- 
sured redshifts and 111 estimated ones, giving about 
78% for completeness in redshifts. 

The number density of clusters within 0.01 < z < 
0.08 is 1.6 times higher than within 0.08 < z < 0.12, 
indicating that there might be present a selection ef- 
fect. 

2.1. Uncertainties 

The uncertainties in deriving the cluster distances 
from their redshifts can be roughly divided into two 
categories - those for clusters with measured redshifts 
(the main bulk of the data) and those for clusters with 
distances estimated from their luminosity function. 

For clusters with measured redshifts the highest 
uncertainties are caused by peculiar velocities, both 
of galaxies within a cluster and of clusters as a whole. 
The uncertainties caused by peculiar velocities of 
galaxies can be corrected to some extent by using red- 
shift measurements of different galaxies in the same 
cluster. This will reduce the errors by a factor of 
■\/N z , where N z is the number of measured redshifts 
in the particular cluster. The average velocity dis- 
persion has been found between 657 km/s for R > 0, 
758 km/s for R > 1 and 989 km/s for R > 2 clusters 
(Girardi et al. 1993). 

Another error of the same order is due to the pecu- 
liar velocities of clusters. The Local Group has been 
found to move at 600 km/s relative to the Hubble flow 
(Rowan-Robinson et al. 1990). An analysis of nearby 
Abell clusters gave an average velocity dispersion of 
600±105 km/s (Postman, Huchra & Geller 1992). On 
the basis of these results, we use an error estimate 

a c = 600 km/s, (T g = 800 km/s, 

<T 2 v =<T 2 c +<T 2 g /N z . (1) 

Additional uncertainties arise from the spurious 
assignment of redshifts of foreground or background 
galaxies to clusters. One can assume these clusters 
to have large discrepancies \z est — z b s \ and small N z . 
We found 21 clusters affecting our samples which had 
measured redshifts more than twice different from the 
estimated ones, 19 of which had measured redshifts 
less than the estimates. For these clusters we use 



only the estimated redshifts. The numbers of clusters 
given above already include this correction. 

The error in estimated redshifts is much larger, and 
is found to be about 27% for the northern sample 
and 18% for the south (PW). Since the magnitude- 
redshift relation is determined in the In z ~ m scale, 
the errors should have a normal distribution relative 
to In z. We built the regression of In z b s on In z est 
for R > 1 clusters in the range 0.01 < z < 0.12. The 
rms error of the regression was 

a(ln z) = 0.27. (2) 

This leads to slightly larger deviations towards longer 
distances (31%) than towards us (24%). 

3. Method 

The two-point correlation function is traditionally 
determined by binning the distribution of distances 
between sample objects. This leads to a dependence 
of the results on the chosen bin size. Even if the 
average correlation function £ can point out general 
trends, the standard error estimate depends on the 
width of the bin and should not be used (see Fig. 1). 
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Fig. 1. — Two-point correlation function of the larger 
sample (0.01 < z < 0.12). The thin line shows the 
convolution with a 5 h~ l Mpc triangular kernel, the 
thick line is the convolution with a 20 h~ l Mpc kernel. 
The negative trend of £ is probably due to the incom- 
pleteness of the cluster catalog at high redshifts (see 
Fig. 2). The dependence of deviations on the smooth- 
ing scale is evident. 

However, there exists a precise function of the 
data that does not depend on arbitrary parameters, 



3 



namely the cumulative distribution of distances of 
cluster pairs. It has the disadvantage of being not 
illustrative and it is not clear how to find inhomo- 
geneities in this distribution. 

We estimate the correlation function as 

C(r) = dP cc (r)/dP rr (r)-l, (3) 

where dP cc is the probability density of distances of 
cluster pairs and dP rr is the probability density of a 
Poisson distribution in the same volume. This gives 
us an expression for the cumulative distribution of 
distances of cluster pairs: 

Pcc(L)= f (£(r) + l)dP rr (r). (4) 
■Jo 

Since our data consist of discrete points, the integral 
changes to a sum that can be done in an arbitrary 
interval: 

r=L2 r=L2 

£ AP cc (r)= £(£(r) + l)AP rr (r). (5) 

r=Ll r=Ll 

The left-hand side of this equation is independent of 
binning as it just counts all pairs with mutual dis- 
tances between LI and L2. The term AP rr does the 
same on the right-hand side, but it has an additional 
factor £ (r) . Thus in order to use the cumulative distri- 
bution function we have to parametrize the two-point 
correlation function. This might be difficult to do for 
the full range of distances, but is certainly applicable 
for a selected range. As we wish to detect maxima 
of the correlation function, we use equation (5) for a 
limited distance interval and approximate the behav- 
ior of £ in the interval with a parabola. This leaves 
no binning-dependent terms in the formula. 

The procedure goes as follows. We fix the range 
(LI, L2). If £ has a maximum somewhere between LI 
and L2, then the least-squares fit gives a parabola 
with a maximum in this range. There could also be 
a minimum within the range. If there is neither max- 
imum nor minimum, the parabola has an extremum 
outside of the selected range. We shall also check the 
goodness of the fit, since the parabola might be a bad 
approximation (eg., if there are more than one max- 
ima). The criterion for this is cr cr = 1/N p , where N p 
is the number of distances in the chosen interval. As 
<7 cr is the increase of P cc per cluster pair, fits giving 
a residual dispersion a greater than cr cr have higher 
deviations than the signal and will be discarded. 



We check all possible combinations of LI and L2, 
looking for local maxima at all intervals and at all 
scales within the sample volume. For each interval we 
perform a bootstrap resampling (Efron 1979; Efron 
& Tibshirani 1986) of our cluster catalog by selecting 
clusters randomly with replacement. This procedure 
will usually select about 1/3 of the clusters more than 
once, and while it is acceptable for many statistics, 
the repeated clusters cannot be used to estimate the 
distance distribution of cluster pairs. One possible so- 
lution is to smooth the observed distance distribution 
and to resample distances from the smoothed distri- 
bution instead of resampling clusters (Silverman & 
Young 1987). Another one is to continue resampling 
from the original data set, perturbing positions of the 
sampled clusters (Kendall & Kendall 1980). We did 
not use either of these methods. The first possibil- 
ity may give a too small variance and the chosen set 
of pairs would not correspond to any particular dis- 
tribution of clusters. For the second one we did not 
find a smoothing distribution that would not induce 
a smearing scale in the correlation function. We de- 
cided to count repeated clusters as being single, that 
gave us sample sizes about 2/3 of the original dataset. 
The average distance distribution of these samples co- 
incides with the observed distance distribution, and 
the result of our choice could only be overestimating 
the variance of the distribution that is at least not 
misleading. 

After selecting the subsample, we shift clusters by 
distance uncertainties selected from the normal dis- 
tribution (see eq. [1]) for clusters with measured red- 
shifts and from the lognormal distribution with the 
rms error given by eq. [2] for clusters with estimated 
redshifts. For each bootstrap sample we perform the 
analysis described above. The bootstrap resampling 
is repeated 1000 times for each interval and fits with 
an average a greater than discarded. 

4. Analysis 

Our method is robust in the sense that it does not 
depend on the magnitude of £, but only on its local 
oscillations. This gives us the freedom not to correct 
for any global trends, such as selection effects. In 
order to see how a possible decrease of the cluster 
number density with distance changes the correlation 
function, we calculated £ for a sphere in which the 
density changed linearly with radius, falling to half 
its central value at the edges of the sphere (Fig. 2). 
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The result shows clearly that this causes a large-scale 
trend without local fluctuations and may induce a 
maximum within an interval that is about a half of 
the size of the whole volume. 

0.2 I I 
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Fig. 2. — Correlation function of a simulated selection 
effect within a sphere with diameter D = 1. The 
matter density falls to half its central value at the 
edges of the sphere. 

Let us first consider possible results. We may find 
no maximum in the selected interval, in which case 
we can not say if it is missing or if our method is 
simply too insensitive to detect it. Secondly, there 
may be a maximum detected at a very low confidence 
level (around 50%), in which case it is caused by few 
clusters, thus being a feature of the particular sample. 
Thirdly, if a maximum is found with a confidence level 
near 100%, then it is not sensitive to local variations 
and should exist in all samples, thus in both of the 
cluster samples. 

We shall first consider the nearby sample that ex- 
tends to 520 h~ l Mpc. As the zero-point of the cor- 
relation function has been found around 50 h~ l Mpc 
(Klypin & Rhee 1994), we expect no secondary max- 
ima at smaller distances. We change the limits of the 
interval with a step of 5 h~ l Mpc and plot the posi- 
tions of maxima for all intervals with confidence levels 
50% and higher against the search interval (L2 — LI) 
in Fig. 3. This plot shows a lot of substructure but it 
is evident that there is at least one maximum between 
100 and 200 h~ l Mpc. The highest confidence level 
is 98% for a maximum centered at 135 h~ l Mpc on a 
160 h~ l Mpc search interval. The second highest con- 
fidence level is 83% for a maximum at 440 h~ l Mpc 
on an interval of 120 h~ l Mpc. 
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Fig. 3. — Positions of maxima for search intervals 
with confidence level higher than 50% for the smaller 
sample, plotted against the width of the interval. This 
scatter diagram shows the distribution of maxima 
within all selected intervals. 

The volume of the second sample is about 4 times 
larger than that of the first, so we expect it to hide the 
local features of the smaller sample. Unfortunately 
about 1/5-th of the clusters have redshift uncertain- 
ties around 30% that may make some maxima unde- 
tectable. The results are given in Fig. 4. The confi- 
dence level of the most significant maxima in the first 
sample has lowered and is 89%, perhaps due to the 
inclusion of clusters with high distance uncertainties. 
The highest confidence level in this sample is 94% for 
a maximum at 300 h~ l Mpc on intervals of 100 - 120 
h~ l Mpc. The average position of the first maximum 
has shifted to 150 h~ l Mpc . The confidence level of 
the third maximum is 72% at 425 h~ l Mpc on a 100 
h~ l Mpc search interval. There seems to be a fourth 
maximum centered at 515 h~ l Mpc, but since it has a 
confidence level of only 56%, we shall not consider it 
in the analysis. An important feature of the maxima 
is that they extend from intervals close to zero up to 
certain limits AL max , thus being independent on the 
width of the search interval. 

There is no doubt that the two strongest maxima 
are not local features of the samples. The presence 
of smaller maxima is less clear, but those that show 
up in both samples may well be present in general. 
There is no sign of maxima at larger scales — the 
widest maxima we found had a width of about 220 
h~ l Mpc, and the check was done up to scales of 800 
h~ l Mpc. The results also show a periodic sequence 
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Fig. 4. — The scatter diagram shows positions of max- 
ima for search intervals with confidence level higher 
than 50% for the larger sample. No maxima were 
found outside of the ranges of the plot. 

of maxima with a step of about 140 h~ l Mpc and a 
possible sequence of weaker maxima with about twice 
shorter wavelength. As we can not check whether the 
weaker maxima are also repeated at the positions of 
the main sequence or whether they occur only be- 
tween the main maxima, we shall base our conclu- 
sions mainly on the sequence of maxima with highest 
amplitudes. 

10 5 f i 
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Fig. 5. — Power spectra fixed on small and large 
scales. The long-dashed line is the fit of Peacock & 
West (1992), the short-dashed line is for a smooth and 
the solid line for a sharp maximum at 0.048 h Mpc -1 . 

It is interesting to note that a similar behavior of 
the two-point correlation function can be predicted 



by a simple model of the power spectrum. We take 
the form of the power spectrum used by Peacock & 
West (1992), 

P(k) = Ak/(l + (k/k ) 1 - n ), (6) 

where the fit to Abell clusters gave ko = 0.025 h 
Mpc -1 and n = —1.4 (PW). The correlation func- 
tion can be found by 

Z(r) = 4tt / P(k) k 2 ^-^- dk (7) 

Jo kr 

and is plotted in Fig. 6. It does not exhibit the repet- 
itive pattern, but fluctuations can be produced by us- 
ing the same spectrum with a sharper maximum. By 
fixing the small- and large-scale end of the spectrum 
to match the fit (PW), the period of the oscillation is 
given by the position of the maximum as A = 2ir/k m 
if the peak is sharp. As shown on Fig. 5-6, the most 
suitable spectrum has a distint peak. Smoother max- 
ima give either smaller amplitudes or no oscillations 
at all, not depending on the height. The position of 
the peak is restricted to 0.043 < k m < 0.053 h Mpc -1 
by the data. 
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Fig. 6. — Correlation functions corresponding to 
power spectra on Fig. 5. The notation is the same as 
used on the previous figure. Notice how strong effect 
the sharpness of the maximum of the power spectrum 
has on the fluctuations of £. 

Since the amplitude of the sequence of fluctuations 
decreases with distance, it may be impossible to de- 
tect maxima at very large separations. This agrees 
with our results as we did not find any maxima at 
R > 600 h~ l Mpc. 
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5. Conclusions 

We have found a sequence of maxima in the two- 
point correlation function of rich clusters of galaxies 
which can not be explained by peculiarities of the 
samples. The main sequence has maxima at 110 — 180, 
270 - 330 and 400 - 450 h~ l Mpc, which limits their 
spacing to the range 110 < X p < 170 h~ l Mpc. The 
average value of X p is 140 h~ l Mpc in a very good 
agreement with 130 h~ l Mpc found from the pencil- 
beam surveys. 

Although we detect a periodic behavior of the cor- 
relation function, it is not clear whether there is a sim- 
ple geometry in the distribution of matter. It has been 
shown that distributions based on Voronoi polygons 
give rise to a characteristic scale (Ikeuchi & Turner 
1991), but they can not produce this scale and the 
observed small-scale amplitude of £ simultaneously 
(Williams, Peacock & Heavens 1991). 

Our results set limits to the position of the max- 
imum of the power spectrum P(k). In order to pro- 
duce the fluctuation amplitude and period found from 
the data, the spectrum must contain a distinct peak 
at k m = 0.048 ± 0.005 h Mpc" 1 (A m = 120 - 145 
h~ l Mpc). This result is in agreement with upper 
limits for k m (A m = 100 — 160 h~ l Mpc) found by 
different surveys, strongly confining the overall shape 
of the spectrum. 
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